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Abstract. We present the underlying relations between colour-magnitude diagrams 
(CMDs) and synthesis models through the use of stellar luminosity distribution func- 
tions. CMDs studies make a direct use of the stellar luminosity distribution function 
while, in general, synthesis models only use its mean value, even though high-order 
moments can also be obtained. We show that the mean, high-order moments and in- 
tegrated luminosity distribution functions of stellar ensembles are related to the stellar 
luminosity distribution function, within th e formalism of probabilistic synthesis mod- 
els. More details have been yet presented in lCervino & Lur idiana ( 2006) and references 
therein. As a direct application of this formalism, we discuss two key issues. First, in- 
ferences on the upper mass limit of the initial mass function as a function of the total 
mass of clusters. Second, we apply extreme value theory to show that that the cluster 
mass obtained from normalising the IMF between m max and m llp does not provide the 
cluster mass in the case where only one star in this mass range is present, as assumed 
in the IGIMF theory. It provides instead the cluster mass with a 60% probability to 
have a star with mass larger than m max , and we argue that in light of this result the basic 
formulation ofthe IGIMF theory must be revised. 



1. From stars to stellar ensembles and the mass-luminosity relation 

Our basic knowledge of the Universe stems from the light received from observed 
sources. In a first-order approximation (neglecting interactions with the interstellar 
medium and non-stellar components), we can consider two types of sources: individual 
stars and stellar ensembles. Since in this case the emission of an stellar ensemble is just 
the sum of its individual components, we can refer to this emission as integrated light, 
coming from an unresolved system such as a distant cluster, or a pixel/slit/IFU in an 
image of a galaxy. The problem of inferring physical properties from the observed data 
can thus be reduced to the analysis of the observed light of individual stars in terms of 
theoretical stellar models, or, in the case of integrated light, to decompose the integrated 
light into its (stellar) components. 

Obviously, the interpretation and physical inferences that can be obtained from 
the integrated light depends on our physical knowledge of the individual sources that 
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would be present in the ensemble. It implies to have theoretical models which cover 
the emission of all possible stars in the ensemble or, at least, to be able to model the 
emission of the most luminous ones, which are also the more massive in a first order 
approximation. The reason being that they dominate the integrated emission of stellar 
ensembles. However, the evolution and spectra of massive/luminous stars are far from 
being solved problems in astronomy, and constitute a very active research area. The 
impact of mass loss, rotation, magnetic fields, binary interactions, etc, can ch ange the 
physi cal inferences obtained from the observed data (see the contribution by IMassey 
I2010L in these proceedings). 

The fact that integrated light is dominated by the most luminous stars could be an 
advantage for some studies dealing with age inferences. It is, however, a problem if we 
are interested i n inferring the total mass as the statistics of stars present in the field (e.g. 



Salpeter 1955) show that low-luminosity, low-mass stars are the most numerous ones 
and dominate the total mass budget of any ensemble, while high-mass stars are the less 
numerous, with a small contribution to the total mass, but dominate the integrated light. 

The problem we are addressing here, then, is the inference about the total mass 
of the ensemble given its total, integrated luminosity. Homology relations in zero-age 
main sequence stars allow to relate the current stellar mass m(t) and its bolometric 
luminosity (which is extrap olated to the luminosity in any band, wavelength or time 



tx(t) in synthesis codes, see ICervino & Luridianal 120051. and references therein for a 



discussion) by a power law as 

€ A {t) oc m(ty , (1) 

where y has typical value of around 3 for main-sequence stars. 

In the case of unresolved stellar ensembles we are seeking a mass-luminosity re- 
lation with an even simpler functional form: 

£ A (t)/M(t) oc constant , (2) 

where £ A (t) the integrated luminosity in a given band/wavelength of an stellar ensemble 
whose total mass in stars is M{t). We stress the difference in the functional dependence 
of the luminosity in Eq. Q] and Eq. |2l To obtain the se simple rel ations was actually 
the problem that Beatrice Tinsley aimed to solve (e.g. iTins lev 1980), and it leads to the 
development of stellar population synthesis models. 



2. Stellar evolution, CMDs and probabilistic synthesis models 

Stellar evolution theory (in particular evolutionary tracks) predicts the evolution of stars 
with a given initial mass m as a function of time t measured from the so-called zero-age 
main sequence. We note this as, for example, the luminosity of a star as a function of 
time, given its mass: ltradc(f|*n)- During the main sequence phase, the mass-luminosity 
relation is more or less well described by Eq. [T] although such a relation does not apply 
for evolved phases. Provided a large grid of tracks and an interpolation algorithm we 
can transform the luminosity of a set of stars with given initial masses as a function of 
time, (\_ rai ±{t\m), into the evolution of stars at a given time as a function of their initial 
masses, lj sn (m|f) , that is, isochrones (such t ransformation is not always a trivial task, 



we refer to, e.g.. ICervino & Luridianall2005l. for details). 
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At first glance, Colour-Magnitude Diagrams (CMDs) are just a projection of such 
isochrones in a particular plane given by the choice of filters. Until recently, the compar- 
ison of the shape of the observed sequences in these diagrams with a set of isochrones 
provided highly valuable information about physical properties of the ensemble of stars, 
such as ages, distances and metallicities. The comparison of shapes, however, results 
in a number of degeneracies, such as the age-metallicity degeneracy in the sense that 
isochrones of different ages selected at different metallicities have the same shape. One 
has to keep in mind that, besides the observational errors, the location of stars in a CMD 
is a sequence of mass. When a CMD is not just considered as the 2-D locus of stars 
of a given age and metallicity, but as a density structure which takes into account how 
many stars are in each point of the diagram, the degeneracy can be lifted. The reason 
is that the density of stars along an isochrone is a very sensitive probe of both age 
and metallicity. So, taken into account the den sity, not only can the basic parame ters of 



these single stellar populations be inferred (see lHernandez & Vails-Gabaudll20 08j. for in 



a classical Bayesia n way), but also thei r star formation and chemical evolution histories 
(see, for a review, IValls-Gabaudll201ll) . 



A similar formalism for interpreting CMDs taking into account the density is the 
representation of the stellar luminosity distribution function (the distribution of stars 
that share a common, given luminosity). Taking the distribution that provides how 
many stars are born with a given mass m at a given age t, that is, the stellar birth- 
rate b(m, ?|A1), any stellar luminosity distribution function or CMD can be produced. 
The aim of CMD analyses is just the inverse problem, that is, to infer b(m, t) from 
observations. 

It is traditionally assumed (see, e.g.. iTinsieyl fl 980) that the stellar birth rate can be 
decomposed into two separable functions: the stellar initial mass function (IMF) (j>{m), 
and the total amount of gas transformed into stars at a given time, i.e., the star formation 
rate history \fr(M,t) = dM(t)/dt: 

b(m, t\M) dm dt - tf>(m) dm x if/(t\M) dt . (3) 

This decomposition of the stellar birth rate is the basis of most of the actual research 
in the studies of stellar ensembles: the use of isochrones make sense only under the 
assumption that the stellar masses produced in an star formation event do not depend 
on the amount of gas transformed into stars in the event itself nor in previous events. 
Formally, this decomposition with small m odifications remains valid if an initial cluster 
mass function, ICMF ^( Aldus) is included (Weidner et al.ll2010h : 



b(m, Aldus, t\M) dm dAl c i us dt - <f>{m) dm x <p( Aldus) dAl c i us x ^(*|Al c ius) dt , (4) 

as far as an integrated galactic initial mass function, IGIMF, which includes the IMF 
and ICMF functions, can be separated from the star formation history, and the star 
formation history refers to the amount of gas transformed into clusters, Aldus, instead 
of the amount of gas transformed into stars, Al. 

In the following we just consider the IMF instead the IGIMF to obtain the stellar 
luminosity function. Given the IMF and the isochrone at a given time we can obtain the 
distribution function that gives us the probability of obtain a star with a given luminos- 
ity. The implicit assumption is that the IMF is a probability distribution function which 
provides the probability to have a star with a given m ass, but not an exact numb er of 
stars. The stellar luminosity function <pe dt is, trivially, dCervino & Luridianal l2006) 
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(d€(m)\ 

W = <Km) x [-^TJ ' (5) 

The different inflection points in the mass-luminosity relation will yield features 
in the luminosity function which do not appear in the mass function. The mean of the 
distribution (i.e. the mean luminosity of clusters which contain just one star, (_&*)) is 

f mu P (d(im)Y l dttm) f m "P 

<£i*>= l{m)<p{m) — — -^—dm = €{m) <p{m) Am , (6) 
Jm low \ dm j dm J m , ow 

which is the equation solved by synthesis codes. 

Equation [6] shows two important issues that must be considered with caution. The 
first one is related with the derivative of I with the initial mass m. Such derivative is not 
always defined and isochrones have discontinuities in post-main sequence evolutionary 
phases, the so called fast evolutionary phases where Eq. [Qdoes not apply. Actually 
such phases are better described in terms of the lifetime of the phase for a given mass m 
(typically the turn-off mass) rather than with initial masses. This issue leads to different 
algorithms to compute s ynthesis models, either using fuel consu mption theorem or else 
isochrone synthesis (see Buzzonifl989l : lMarigo & Girardill200lL for more details). The 



important point here is that since it is the stellar luminosity distribution function the one 
that must be well sampled, size-of-sample effects may be still present in the sampling 
of fast evolutionary phases, even with a well-sampled IMF. 

The second cautionary note refers to the IMF mass limits themselves. The for- 
mulation and the modelisation of stellar ensembles is valid as far as m up (and formally 
m\ ov/ ) have well-defined values. The mean value depends on both parameters and the 
functional form of the stellar luminosity distribution function. 

When dealing with an ensemble of N stars the corresponding distribution which 
describes the possible integrated luminosities of the ensemble is N times the self- 
convolution of the stellar luminosity function (under random sampling of the stellar 
luminosity distribution fun ction assumption). Then the mean total luminosity is (see 
ICer vino & Luridianall2006l for details): 



(£ N *) =NX(£ U ) =JVx(f> . (7) 

We stress again that, as far as (i) the stellar luminosity distribution function must be 
sampled and (ii) its high luminosity tail is related to the lifetime of particular evolution- 
ary phases, a random sampling hypothesis (for the stellar luminosity function) is still 
valid even under a sorted sampled IMF hypothesis. 

This formalism can equally be applied to the total mass of the ensemble: the total 

mass distribution of an ensemble of N stars is described by N self-convolutions of 

i ■ 1 1 1 j 

the IMF itself (Sel man & Melnickl 12008) and the mean value of the total mass of the 
ensemble is ,/V times the mean mass of the IMF: 



(M N ,) = Nx(Mu) = Nx(m) . (8) 
Remarquably, this implies that 

(£ Nt ) Nx(£ u ) (£ u ) 



{Mm) Nx (Mi*) (M u ) 



constant (9) 
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and hence validates the approach given by Eq. |2] Note that this relation provides a 
valid (averaged) total mass as far as the observed luminosity is a good proxy of the 
mean luminosity of stellar ensembles with N stars, and the age is known. 



3. Size-of-sample issues 

We have shown that synthesis models results are related to CMD analyses through a 
collapse of the underlying distribution that defines the CMD in its moments. Obvi- 
ously, the mean value of the distribution is not the only possible outcome of synthesis 
codes, and high-order mom ents or even the distributions themselves can be obtained 
(ICervino & Luridianal 12006). Working with distribution moments has the advantage of 
providing a first order approximation of the underling distribution. From the Central 
Limit theorem, we know that an infinite self-convolution of a distribution with finite 
limits tends to the Gaussian form. High-order moments allows us to define how many 
stars (and average mass) ar e needed to obtain Gaussia n or quasi-Gaussian integrated 
luminosity distributions (see Cervino & Luridiana 2006, for details). This is the basic 



requirement for x 2 fitting algorithm^]. In particular the second moment of the stellar 
luminosity distribution function (the variance) also allows one to evaluate the relative 
error in the use of an observation as a proxy of the mean of the underling distribution. 
However the situation becomes more complicated when a small number of stars defines 
the integrated luminositjQ. 

In general three cases can be defined: (a) Systems/observations which contain a 
large number of stars yielding gaussian distributions of the theoretical integrated lumi- 
nosity. In this case the observations can be used as proxy of mean values safely. These 
situations typically correspond to galaxies and systems with masses larger than 10 8 M Q 
when all the light from the system is covered by the IFU/slit. (b) Systems/observations 
with a large number of stars to ensure gaussian-like distributions, but not large enough 
to provide a negligible variance. In these cases both the mean value and the variance 
of the underling distribution for each band/wavelength must be considered. This situ- 
ation corresponds to systems typically more massive than 10 5 M Q although the exact 
value depends on the considered band/wavelength, (c) Extreme size-of-sample effects: 
these are systems/observations with a number of stars so small that the distributions of 
the integrated luminosities are not Gaussians. In these cases, the analysis of observed 
data in terms of the mean and high-order moments of the distribution of the integrated 
luminosity is not suitable since they do not provide a proper information on the distri- 
bution (mean and mode differs, variance can be not translated in confidence intervals 
etc.). In this situation, it is more accurate to work directly with integrated luminosity 
distribution functions which can be obtained by self-convolution of the stellar luminos- 
ity distribution function (which has several problems at computational level except for 
a low number of stars) or by the analysis of Monte Carlo simulations (which are more 
economic at the computational level but which lead to some subtle problems, such as 



'Although x 1 seems the simplest tool to make inferences from observed data (the inverse problem) 
we emphasize that there are many different methodolog ies (as we ll as in terpretations) on how to deal 
with inverse problems. For a simple introduction see iTarantolj d2006l) as well as other articles at 
http : //www . ipgp . fr/~ tarantola 

2 Note that small is not just an intrinsic property of the studied system, but also an observational effect: an 
IFU observation intrinsically contains a smaller number of stars than a large aperture observation. 
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increasing the number of simulations when the number of stars in the simulated clusters 
decreases to provide a correct sampling of the underling distributions). 



4. Low Luminosity Limit (LLL), Aldus and m up 

Unfortunately we have no estimation of the total mass of the system before an analysis 
is performed, and simple recipes are useful to know in which of the previous regimes the 
observational data lie. A simple m e thod, called the Low Luminosity Limit (LLL), was 
proposed by ICervino & Luridianal (120041) . The method just compares the luminosity 
of the observed ensemble with the luminosity of the most luminous star assumed in 
the model, that is, the extreme value of the stellar luminosity distribution function (the 
LLL). It provides the lowest luminosity that an stellar ensemble should have without 
been possibly confused with an single star. 

An additional simple test is just the comparison of the observed light from a stellar 
ensemble with the locus of the mean values ob tained by synthesis models and the locus 
of individual stars in color-color diagrams (see lBarker et al.ll2008l as an example): clus- 



ters affected by strong sampling effects (where the mean values are not representative) 
cover the intermediate area between stars and the mean values obtained by synthesis 
models. 

A similar situation is present while making inferences of total masses Ald us for 
studies of the maximum mas s m max that a cluster with total mass Aldus wou ld contain 



with m max - Aldus diagrams dSelman & MelmcMl2008l : IWeidner et al.ll2010l . and refer 
ences therein). The observed maximum stellar mass can be obtained from observations. 
However, Aldus must be inferred under a situation where we know for sure that the IMF 
is not well sampled. In these cases, one has to use all observational constrains (such 
as the number of stars with different masses), run a large enough set of Monte Carlo 
simulations which fulfills the observational constrains, and obtain the distribution of 
possible total masses. Note that the usual method of correcting the total mass in the 
unobserved mass range with the mass obtained from a truncated IMF from «ii ow to the 
minimum observed mass implicitly assumes that the IMF is fully sampled in the 
low mass interval (which is equivalent to say that the IMF is sorted sampled up to m ~? n ). 

Finally, we address the problem of the inference of m up from observations. The 
probability of an extreme value, say m up , is precisely the subject of Extreme Value The- 
ory (EVT). This branch of statistics deals with extreme deviations (maxima or minima) 
from the median and it has important consequences in every day life (such as economic 
crashes) and natural catastrophes produced by deviations in annual flood flows, precip- 
itation maxima or earthquakes (als o with human-l ife and economic implications). The 
classical referen ce in the subject is iGumbel (11958 ). although we also suggest reading 
Sornettd ( 20041) (especially pp. 18-23), where the case m np = oo is d i scussed in de 



tail, and where results obtained in as t ronom y (e.g. Ivan Al bada (1968); Oe y & Clarke 



(2005]); ifl amm- Altenburg & Kroupal (120081) ) can be also be found. A detailed discus 



sion is presented in a forthcoming paper. 

Within this E VT framework, we an alyse the basic assumption made in the IGIMF 
theory proposed by lWeidner et all (1201 dl) and previous works. Let us consider a sample 
of N stars. An event with probability p occurs typically Nxp times, hence if we expect 
just 1 star in the mass range m max - m up in the cluster we have: 
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(f>(m)dm; Np = 1; p = — 



1 

N 



(10) 



and we can obtain the total number of stars and make an estimation of the total mass of 
such a cluster, (Ald us) = N x (m), (provided that we are in a regime where (m) can be 
safely used! but see lSelman & Melnick 2008). 

We also know from EVT the PDF of the ex t reme values for /V stars, so, we can 
ask what I/TV really means. Following Sornette ( 20041) the estimation of m max from 
l/N is in fact the stellar mass that is not exceeded with a probability p = 1/6 = 0.37 
for a cluster with N stars. It means that (1 - p)=63% of clusters with N stars (and 
hence (At c i us ) = N X (m)) have a star with mass larger than wj ma x- 

In other words, 

the estimation of the relation between m max and Al c ius as assumed in the IGIMF theory 
may not be correct in 63% of real cases. 



5. Conclusions 

In this contribution we have shown that the physical inferences obtained from CMDs 
studies are related with the results of evolutionary synthesis codes by the stellar lu- 
minosity distribution function. Since CMDs basically provide information about the 
distribution itself and the synthesis models typically just provide the mean of such 
distribution, a direct conclusion is that a CMD provides always a more complete infor- 
mation about star formation than the use of mean values obtained by synthesis models. 

The mean of the stellar luminosity distribution function is defined by its functional 
form as well as its limits (the upper one is non-trivially related to m up ). This mean as 
well as high-order moments are directly proportional to the mean integrated luminosity 
of ensembles of N stars. Such proportionality provides mass-luminosity relations for 
systems/observations with a large number of stars (such that the average total masses 
are larger than 10 8 M©) where the observed luminosity can be safely used as a proxy 
of the theoretical mean integrated luminosity. However, the use of one observation as a 
proxy of the mean of the theoretical distribution of integrated luminosities must be used 
with caution for less massive systems: it must be used together with the variance for 
ensembles with mean mass values in the range 10 5 to 10 8 M Q , and simply not used for 
ensembles with mean total masses smaller than 10 5 M . For these small (by number) 
systems ones has to use the distribution of integrated luminosities rather the parametric 
descriptions of the distribution. 

The reason of these limitations is the incomplete sampling (or size-of-sample ef- 
fects) of the stellar luminosity distribution function in the ensemble. Since the stellar 
luminosity distribution function contains a (low-luminosity) component related to the 
IMF, and a (high luminosity) component related to the short lifetime of fast evolution- 
ary phases, an intrinsic scatter due to random sampling is always present even in cases 
of a perfe ct or sorted sampling of the IMF More details about these limitations can be 
found in ICerviflo & Luridianal (12004 120051. |2006) and references therein. 



One of the difficulties related with size-of-sample effects is that the mean values 
of the distributions are not representative of the distributions themselves. Or, in a prac- 
tical way, observational data cannot be compared with the mean values provided by 
synthesis model to make physical inferences. This situation is especially relevant in the 
computation of total stellar masses of under-sampled clusters like the ones inferred to 
relate the total mass of a cluster with the mass of its more massive star. In these cases, 
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one has to use all observational constraints (such as the number of stars with different 
masses), run a large enough set of Monte Carlo simulations which fulfills observational 
constrains, and obtain the distribution of possible total masses. Any other methodology 
(such as to complete the unobserved mass range up to a given mass with a mean value 
of a truncated IMF) introduce biased results (just like an artificial sorted sampling). 

Finally, we show that the basic assumption implicit in the formalism of the IGIMF, 
that is, that when the integral of the IMF over some given interval (m*(A4 c ius) to m up ) 
is normalised to unity this provides the cluster mass Al c i us that contains just one star in 
that interval (and that this individual star has a mass equal to ra*(yVt c ius)) is not correct. 
Extreme value theory shows that such an assumption just provides a A4 c ius value which 
has a 63% of probability to have a star with a mass larger than m*(Alci us ). In this 
situation some of the basis of the formulation of the IGIMF theory must be revised. 
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